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ESCHERICHIA COLI STRAIN SECRETING HUMA|T GRANULOCYTE 
COLONY STIMULATING FACTOR (G-CSF) 



5 BACKGROUND OF THE INVENTION 

Field of the In vention 

The present invention relates to an jEJ. coli producing 
10 and secreting human granulocyte-colony stimulating 
factor (hG-CSF) ^ more specifically, to a recombinant plasmid 
constructed to express secretory hG-CSF in E. coli, an E. 
coli transformed with the said plasmid to secrete hG-CSF, 
and a process for preparing hG-CSF using the said 
15 transformed £?. coli. 

Description of the Prior Art 

Colony stimulating f actors (CSFs) known to be 

20 synthesized by various cell types such as mononuclear 
macrophages, T-lymphocytes, and fibroblasts are found in 
the various parts of noirmal human body. CSFs are 
classified into three major categories, granulocyte-colony 
stimulating factor (G-CSF) , macrophage-colony stimulating 

25 factor (M-CSF) and granulocyte/macrophage-colony stimulating 
factor (GM-CSF) , among them, G-CSF is an essential protein 
in manufacturing blood cells via promoting proliferation 
and differentiation of hemopoietic stem cells, and 
facilitates increase in numbers of granulocytes, especially, 

30 neutrophils which play an important role in the protection 
of the body from the infection. Chemotherapies widely 
used to treat growing tumors not only inhibit the growth of 
tumors but also inhibit the production of neutrophils, 
giving rise to severe side effects due to the diminished 

35 protecting function of neutrophils. Administration of G- 
CSF to the patients under such chemotherapies is known to 
be an effective way of treatment and prevention of the 
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infectious diseases by means of facilitating the increase 
in neurophil numbers. 

In 198 6, the hG-CSF gene was isolated from a human 
squamous carcinoma cell line CHU-II, its nucleotide 
5 sequence was first determined and expressed in COS cells by 
Nagata et al. [^^: Nagata et al.. Nature 319: 415 (1986)). 
The hG-CSF is a glycoprotein comprising 30 signal peptides 
which consists of 174 amino acid residues. The hG-CSF 
includes 5 cysteine residues of which 4 cysteines form two 

10 disulfide bonds, between Cys--36 and Cys-64, and between 
Cys-64 and Cys-74, which serve for folding of the expressed 
protein and its activity (sjaa: Hill et al., Proc. Natl. Acad. 
Sci.r USA, 90;5167-5171 (1993) ) . The hG-CSF dose not have 
the consensus sequence (Asn-X-Thr/Ser) for N-glycosylation, 

15 but 0-glycosylation occurs at Thr-133. However, the 
recombinant G~CSF produced in E. coll is known to have 
almost the same biological activities as natural G-CSF, 
which means glycosylation is unnecessary for the G-CSF 
activity. 

20 With the recent progress in recombinant DNA 

technology, G-CSF can be produced in bacteria, plant cells 
and animal cells, and some results previously reported are 
described here: Souza et al. isolated a cDNA from the human 
bladder cancer cell line 5637, determined its sequence and 

25 reported its expression in coli(,ae^: Souza et al.. 

Science, 232:61(1986)). Moreover, researches on production 
of hG-CSF in E. coli, plant cells and animal cells, and on 
construction and production of hG-CSF derivatives have been 
reported. However, technologies known to date for the 

30 production of hG-CSF in E. coll have many disadvantages in 
terms of protein yield or production cost since the hG-CSF 
is produced in cytoplasm in the form of insoluble inclusion 
body, . which requires subsequent solubilization and 
renaturation to obtain biologically active form of hG-CSF 

35 protein. Although small quantity of soluble hG-CSF can be 
isolated directly, such method still have limitations in a 
sense that the active fraction of hG-CSF protein has to be 
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isolated from the pool of enormous amounts of E. coli 
proteins. 

In general, proteins secreted to the periplasm ^of 
coli carry signal sequence, which is found in all proteins 
transportable out of the cytoplasm, and cleaved off by 
signal peptidase in the periplasm. The signal sequence is 
essential in secreting proteins in E. coli. Therefore, 
recombinant proteins originally not encoded by E. coli 
genes can be secreted into the periplasm or to the 
extracellular broth by joining known signal sequence (OmpA, 
OmpF, PelB, PhoA, SpA, etc.)/ as it is or with slight 
modifications, to the N-teorminus of gene coding an 
exogenous protein. 

The method for production of hG~CSF by secretion into 
the periplasinic space has following advantages over the 
conventional method by producing in .cytoplasm described 
above: first, it is easy to isolate and purify the 
recombinant proteins, with high purity, in periplasm than 
in cytoplasm, since there are fewer proteins in periplasm 
than in cytoplasm Nossal, N.G, et al., J. Biol. Chem. , 
241:3055-3062(1966)); secondly, recombinant proteins 
secreted into periplasm are segregated from the cytoplasm 
where the most proteases are found, obtaining high yield in 
production of recombinant protein by avoiding degradation 
of the protein by proteases present in cytoplasm (.aa^: 
Meerman and Georgiou, Ann. . N.Y. Acad* Sci., 721:292- 
302(1994)); thirdly, the bacterial periplasm is more 
oxidizing environment than cytoplasm, conducting disulfide 
bond formation and correct folding of polypeptide, easily, 
thus, the formation of insoluble aggregates is 
avoidable (.SJ^: Hockney, TIBTECH, 12:456-463 (1994)). 

Having such advantages, the method resulting in 
secretion of recombinant proteins into the periplasm has 
been employed for production of hG-CSF in E, coli and 
reported as follows: Perez-Perez et al. have tried to get 
secreted form of hG-CSF, employing OmpA which is one of the 
signal sequences in E. coli, without success. To solve that 
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problem^, they employed the system for coexpression of two 
molecular chaperones, DnaK and DnaJ, "and merely obtain a 
small quantity of secreted hG--CSF {.ae^: Perez-Perez et al . , 
Biochem. Biophys, Res. Commun. , 210 : 524-529 (1995) ); and, 
Chung et al. have tried to obtain . secreted form of hG-CSF 
employing another signal sequence, PelB, again without 
success, but hG-CSF was accumulated in the form of 
insoluble inclusion body in cytoplasm Chung et al-, l7. 
FerjTzen. BioBxigin. , 85:443-446(1998)). 

In view of above situation, there is a continuing 
need to develop the technique for facilitating secretion of 
.hG-CSF at a substantial level into the periplasm through 
rather simple process which does not require solubilization 
of insoluble inclusion body and subsequent refolding. 

SUMMARY OF THE INVENTION 

The present inventors have made an effort to develop 
the technique for facilitating secretion of hG-CSF into the 
periplasm, thus, they have found that E. coll, transformed 
with the recombinant plasmid containing nucleotide sequence 
coding for oligopeptide consisting of 13 amino acids, 
Bascillus-derived endoxylanase signal sequence and 6 
histidine residues, can secrete oligopeptide/hG-CSF fusion 
protein efficiently so that hG-CSF protein can be produced 
using the said transformed E. coll. 

The first object of the present invention is, 
therefore, to provide the recombinant plasmid constructed 
for E, coll to express and secrete hG-CSF fusion protein 
therefrom. 

The second object of the invention is to provide 
transformed E. coll with the said plasmid. 

The third of the invention is to provide a process 
for producing hG-CSF protein employing the said 
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BRIEF DESCRIPTION OF THE DRAWINGS 

5 The above/ the other objects and features of the 

invention will become apparent from the following 
descriptions given in conjunction with the accompanying 
drawings, in which: 

10 Figure 1 ^is the nucleotide sequence of hG-CSF gene 

inserted into the plasmid plSCSF. 
Figure 2 shows a construction scheme and a genetic 

map for the plasmid pl9CSFm- 
Figure 3 is the nucleotide sequence and the deduced 
15 amino acid sequence of hG-CSF gene 

inserted into the plasmid pl9CSFm. 
Figure 4 shows a construction scheme and a genetic 

map for the plasmid pEDCSFm. 
Figure 5 is the nucleotide sequence and the deduced 
20 amino acid sequence of hG-CSF.. gene 

inserted into the plasmid pEDCSFm. 
Figure 6 shows a construction scheme and a genetic 

map for the plasmid pEDCSFmll. 
Figure 7 is the nucleotide sequence and the deduced 
25 amino acid sequence of the N-terminal 

portion of hG-CSF gene inserted into the 
plasmid pEDCSFmll. 
Figure 8 shows a construction scheme and a genetic 

map for the plasmid pTrcSCSFmll. 
30 Figure 9 is the nucleotide sequence and the deduced 

amino acid sequence of the N-terminal 
portion of hG-CSF gene . inserted into the 
plasmid pTrcSCSFmll. 
Figure 10 shows a construction scheme and a genetic 
35 map for the plasmid pTrcKCSFmll- 

Figure 11 shows a construction scheme and a genetic 

map for the plasmid pTHSCSFmll. 
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Figure 12 is the nucleotide sequence and the deduced 

amino acid sequence of the N-terminal 
portion of hG-CSF gene inserted into the 
plasmid pTHSCSFmll. 
5 Figure 13 shows a construction scheme and a genetic 

map for the plasmid pTHKCSFmll. 

DETAILED DESCRIPTION OF THE INVENTION 

10 The recombinant plasmid of the invention designed for 

secreting hG-CSF fusion protein efficiently from E. coll 
contains: a kanamycin resistance gene, an endoxylanase 
signal sequence, a nucleotide sequence coding for an 
oligopeptide consisting of 13 amino acid residues including 

15 6 consecutive histidine residues, a modified hG-CSF gene 
and Trc promoter- The recombinant E. coll of the invention 
secreting hG-CSF protein efficiently is prepared by 
transformation of E» coll with the said recombinant plasmid - 
The hG-CSF protein can be prepared with high purity by 

20 isolating the hG-CSF fusion protein from the protein pool 
obtained from the transformed E. coll using a Ni-column and 
subsequent protease treatment. 

The present invention is illustrated in more detail 
25 as followings. . ' * 

First, the recombinant plasmid vector pTHKCSFmll, 
containing cDNA encoding hG-CSF protein, endoxylanase 
signal sequence for hG-CSF protein secretion, Trc promoter 

30 which is a strong inducible promoter, kanamycin resistance 
gene encoding non-secretory protein, and a nucleotide 
sequence encoding an oligopeptide consisting of 13 amino 
acid residues including 6 consecutive histidine residues 
between endoxylanase signal sequence and hG-CSF polypeptide, 

35 is constructed for preparing transformed E. coll which 
expresses and secretes hG-CSF fusion protein into the 
periplasm: wherein, cDNA -encoding hG-CSF protein is 
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obtained by joining 3rd exon-deleted hG-CSF cDNA obtained 
from a human breast carcinoma cDNA library to the 
synthesized 3rd exon^^ Bacillus sp. -derived endoxylanase 
signal sequence is employed for facilitating secretion of 
hG-CSF protein from E. coli, and kanamycin resistance gene 
is employed as a selection marker. Furthermore, to prevent 
cell lysis occurred during the secretion of hG-CSF protein 
in E. coli, a nucleotide sequence encoding an oligopeptide 
consisting of 13 amino acid residues including 6 
consecutive histidine residues was included between DNA of 
endoxylanase signal sequence and hG-CSF gene, and the amino 
acid sequence of the oligopeptide is N ' -Ala-Gly-Pro-His- 
His-His-His-His-His-Ile-*Glu-Gly-Arg-C' (SEQ ID NO: 1) . 

Subsequently, recombinant E. coli secreting hG-CSF 
fusion protein is prepared by introducing the plasmid 
pTHKCSFmll constructed above into £- coli: wherein, E. coli 
' strains BL21(DE3) , HBlOl, MC4100, W3110 and XLl-Blue, 
preferably MC4100, can be used. coli MC4100 transformed 

with the recombinant plasmid pTHKCSFmll was named E. coli 
MC4100 /pTHKCSFmll {Escherichia coli MC4100/pTHKCSFmII) , 
which was deposited with the Korean .Collection for Type 
Cultures (KCTC) affiliated to Korea Research Institute of 
Bioscience and Biotechnology (KRIBB) , . an international 
depository authority, under accession (deposition) No. KCTC 
0754BP on Mar. 13, 2000. 

Finally, the complete hG-CSF protein is prepared from 
the protein pool obtained from the transformed E, coli 
using a Ni-column and a protease; the said £. coli secretes 
hG-CSF fusion protein of which N-terminus is linked by an 
oligopeptide consisting of 13 amino acid residues including 
6 consecutive histidine residues. The secreted hG-CSF 
fusion protein is isolated using a Ni-colxamn to which 6 
consecutive histidine residues present in the oligopeptide 
of the fusion protein can bind, and then the complete hG- 
CSF protein can be prepared from the hG-CSF fusion protein 
isolated above by treating a protease to get rid of the 
oligopeptide. Since the hG-CSF protein has to be non- 
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susceptible to the protease employed, the C-terminal 
sequence of the oligopeptide should be selected to be 
cleaved off by the protease of which recognition sequences 
are not present in the hG-CSF protein. As an example, in 
the present invention, the C-terminal amino acid sequence 
of the oligopeptide was selected to be Ile-Glu-Gly-Arg (.a^: 
SEQ ID NO: 1) , which is recognized and cleaved by Factor Xa, 
a protease not having recognition sequences in hG-CSF 
protein. 

The present invention is further illustrated in the 
following examples , - which should not be taken to lim.it the 
scope of the invention. 

Ki<Rm pl& 1 : Preparation of cDNA for hioman granulocyte-colony 

stimulating factor (hG-CSF) 

To isolate hG-CSF gene, hG-CSF cDNA was prepared by 
PGR amplification of a human breast carcinoma cDNA library. 
Nucleotide sequence of the hG-CSF gene was retrieved in the 
GenBank, and * then, primer 1: 

GCGAATTCATGGCTGGACCTGCCACCCAG-3' (SEQ ID NO: 2) and primer 
2: 5'-GCGGATCCTTATTAGGGCTGGGCAAGGTGGCG-3' (SEQ ID NO: 3) 
were synthesized, respectively- For the convenience of 
cloning of PGR product, EcoRl and SamHI restriction sites 
were introduced at primer 1 and primer 2, respectively. 
Then, PGR was performed by, employing High Fidelity PGR 
System (Boehringer Mannheim Co., Germany), under following 
condition: one cycle of denaturation at 94 ''C for 7 min; 30 
cycles of denaturation at 94*^0 for 1 min, annealing at 50 "C 
for 1 min, and extension at 72 1) for 1 min; and, one cycle 
of extension at 72 'C for 7 min. DNA molecules obtained by 
the PGR were subjected to agarose gel electrophoresis to 
isolate DNA fragment of approximately 530bp, which was 
subsequently digested with JScoRI and BamRl to obtain DNA 

fragment concerned. 

In order to clone the DNA fragment obtained above. 
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the said DNA fragment was joined into the plasmid 
pUC19{^je:^: Yanisch-Perron et al . , Gene, 33:109-119(1985)) 
digested with EcoRI and JBai22HI using T4 DNA ligase, and then, 
the ligation product was introduced into coli XLl- 

Blue(sup£;44 hsdR17 r&cAl endAl gyrA96 thi relAl 
lacF' (proAB-i- lacIqlacZA M15Tn(tetr) ) ) ( see : Bullocic, O, 
et al., BioTechniques, 5:376-378(1987)) by electroporation 
technique to obtain transformed coli. The trans foirtuants 
were selected on the LB agar medium containing 

ampicillin (50 /ig/-C ) and the recombinant plasmid pl9CSF was 
obtained therefrom. The nucleotide sequence of the 
fragment in pl9CSF was determined using automatic DNA 
sequencer (ABI Prism model 377, Perkin Elmer Co., U.S.A.). 
It has been found that there was 108bp deletion in the 
middle of the cloned fragment of the invention compare to 
the published nucleotide sequence of hG-CSF gene, meanwhile, 
the rest sequence of .508bp was identical to the published 
sequence of hG-CSF gene (.a^: Figure 1). Comparing with 
the sequence of genomic DNA of hG-CSF, the 108bp deletion 
in the fragment of the invention corresponded to the third 
exon among five exons of hG-CSF gene. 

In order to obtain complete sequence of the hG-CSF 
gene, _ four primers, primer 3: 5'- 

TCCTCGGGGTGGCACAGCTTGTAGGTGGCACACAGCTTCTCCTGGAGCGC-3 ' (SEQ 
ID NO: 4), primer .4: 5'- 

GCTGTGCCACCCCGAGGAGCTGGTGCTGCTCGGACACTCTCTGGGCATCC-3 ' (SEQ 
ID NO: 5), primer 5: 5'- 

TGGCTGGGGCAGCTGCTCAGGGGAGCCCAGGGGATGCCCAGAGAGTGTC~3 ' (SEQ ID 
NO: 6), and primer 6: 5'- 

AGCAGCTGCCCCAGCCAGGCCCTGCAGCTGGCAGGCTGCTTGAGCCAA-3 ' ( SEQ ID 
NO: 7) corresponding the third exon were prepared and PCR 
was carried out. Using primer 1 and the PCR product 
obtained above, pl9CSF was subjected to PCR amplification, 
and with the primer 2, PCR was carried out in the same way 
as with primer 1. Using both PCR products, PCR was 
performed once again and the resulting product was digested 
with two restriction enzymes, JScoRI. and BamHI, and then the 
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fragments were cloned into the same site of pUC19 as the 
site used above. The transformed E. coli XLl-Blue were 
selected on the LB agar medium containing ampicillin (50 /zg/ 
H ) and the recombinant plasmid pl9CSFm was obtained 
therefrom (s^: Figure 2). The nucleotide sequence of the 
fragment in pl9CSFm was determined and found to be 
identical to the sequence of the published hG-CSF gene (.s^: 
Figure 3) • 

F,x;qm ple 2 : Construction of the recombinant plasmid, pEDCSFm 

In order to get hG-CSF protein expressed from the hG- 
CSF gene obtained in Example 1, the mature hG-CSF gene was 
constructed by removing signal sequence from pl9CSFm(.s^: 
Figure 5) . To clone the mature hG-CSiF gene. Primer 7: 5'- 
GCGAATTCATATGACCCCCCTGGGCCCTGCCAGC--3' (SEQ ID NO: 8) was 
prepared. Employing primer 2 and primer 7, pl9CSFm was 
subjected to PCR amplification, and the amplified DNA 
molecules were digested with two restriction enzymes, Nde.l 
and BamRl, The resultant DNA fragment was joined into the 
T7 promoter-containing plasmid pET21c (Novagen Co., U.S.A.) 
digested with Ndel and BamRl. After transformation of E. 
coli XLl-Blue with the ligated product by electroporation, 
the transf ormants were selected on the LB agar medium 
containing ampicillin (50 //g/iJ ) and the recombinant plasmid 
pEDCSFm was obtained therefrom (.sss: Figure 4) . 

For production of hG-CSF protein in E. coli, the 
recombinant plasmid pEDCSFm was introduced into E. coli 
BL21(DE3)(F- ompT hsdSB(rB- mB-) gal dcm (DE3) a prophage 
carrying the T7 RNA polymerase gene). The transformed E. 
coli harboring the recombinant plasmid was inoculated into 
50ml of liquid LB medium in a 250ml flask and incubated at 
37 1^. When the cell concentration reached to OD600 of 
about 0.7, ImM IPTG (isopropyl-g -thiogalactoside) was added 
to the culture to induce the expression of hG-CSF gene. 
After 4 hour induction, 1ml of the culture broth was 
aliquot ed. The aliquot ed broth was centrifuged at a speed 
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of 6/OOOrpm for 5min at 4*C to collect cells, which were 
then washed, centrifuged again at a speed of 6,000rpm for 

5min at 4lC, and then resuspended in 0.2ml of TE buffer. 
The mixture of 64 iiS. aliquot of cell suspension and 16 iJSi of 
sample buf f er (Tris-HCl 60mM: 25% glycerol (v/v) : 2% 
SDS(v/v): 2-mercaptoethanol 14,4mM: 0.1% bromophenol blue) 

was heated at 100 for lOmin, and then subjected to SDS- 
PAGE (sodium-dodecyl sulfate polyacrylamide gel 

electrophoresis) in separating gel. After SDS-PAGE, the 
gel was stained in a staining solution (Coomassie brilliant 

blue R 0-25g/^ : methanol 40%(v/v), acetic acid 7% (v/v) ) 
for 2 hours, and destained two times in destaining 
solution (methanol 40% (v/v), acetic acid 7% (v/v)) for 2 
hours each time. The SDS-PAGE did not show the protein 
band corresponding to hG-CSF protein. 

Example 3 : Construction of the recombinant plasmid, 

pEDCSFmll 

Based on the report by Devlin et al. that high G+c 
content in N-terminal portion of hG-CSF gene has inhibitory 
effect on transcription and translation of the gene(.ss^: 
Devlin et al.. Gene, 65:13-22 (1988)), primer 8 : 5'- 
GCGAATTCATATGACTCCGTTAGGTCCAGCCAGC-3' (SEQ ID NO: 9) was 
prepared to obtain the gene which has lower G+c content 
such that the gene can be transcribed and translated 
efficiently in E* coll* 

Plasmid pl9CSFm was amplified by PGR using primer 8 
and primer 2- The resulting PGR product was digested with 
Ndel and BamHI, cloned into the same site of pET21c, and 
then transformed into E. colx XLl-Blue. The transformed E. 
coll XLl-Blue were selected on the LB agar medium 

containing ampicillin (50 //g/£ ) and the recombinant plasmid 
pEDCSFmll was obtained therefrom (.sss: Figure 6) . Comparing 
with the sequence of the fragment in pEDCSFm, the 
nucleotide sequence of the fragment in pEDCSFmll was found 
to be identical in N-terminal portion of the gene. 
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meanwhile, be different in 6 nucleotide residues overall, 
which give rise the changes in 5 codons (ACC CCC CTG GGC CCT 
-> ACT CCG TTA GGT CCA) . 

Plasmid pEDCSFmll was transformed into E. coli 
BL21(DE3) and its expression was analyzed- Culturing of 
transformed E. colir and production and analysis of total 
protein were carried out as described in Example 2, and 
fractionation of insoluble aggregate was performed as 
follows. Cells obtained by centrif ugation of 1ml of 
culture broth were suspended in 0.5ml of TE buf f er (Tris-HCl 
lOmM, EDTA ImM, pH 8.0), and then centrifuged again at a 
speed of 3,000xg for 5min at 4*0 followed by resuspension 
in 0.2ml of TE buffer. Cells were disrupted by using a 
ultrasonicator (Branson Ultrsonics Co., U.S.A.) and then 
centrifuged at a speed of 10,000xg for lOmin at 4^. The 
supernatant was classified as soluble proteins and the 
pellet was dissolved in 0.2ml of TE buffer and classified 
as insoluble .proteins. The content of hG-CSF protein 
expressed in E. coll BL21(DE3) transformed with plasmid 
pEDCSFmll was identified to be as much as about 40% of 
total protein, and, most of 'hG-CSF protein produced was 
found to be insoluble aggregates with no biological 
activity. 

Example 4 : Construction of the recombinant plasmid, 

pTrcSCSFmll 

The signal sequence of endoxylanase derived from 
Bacillus sp. was used for the secretion of hG-CSF protein 
from E. coli. To join the signal sequence of endoxylanase. 
to the N-terminal of hG-CSF polypeptide by PCR, primer 9: 
5 1 -GGAATTCACATGTTTAAG TTTAAAAAGAAATTC-3 ' ( SEQ ID NO: 10), 
primer 10: 5 • -GGCTGGACCTAACGGAGTTGCAGAGGCGG-3 ' (SEQ ID NO: 
11) and primer 11 : 5 ' -GCAACCGCCTCTGCAACTCCGTTAGGTCCAGCC- 
3' (SEQ ID NO: 12) were prepared, respectively. From PCR 
performed using primers 9 and 10, and endoxylanase gene- 
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carrying plasmid pKJX4 (.s^: Jeong et al.. Enzyme Microb 
Technol. 22 (7) : 599-605 (1998 ) ) as a target DNA, PGR product 
containing endoxylanase signal sequence was obtained, and 
from PGR performed using primers 2 and 11, PGR product 
containing hG-CSF gene was obtained. Again, PGR was 
performed using two PGR products obtained above, and 
primers 2 and 9 to obtain PGR product containing 
endoxylanase signal sequence-fused hG-CSF gene, which was 
then digested with two restriction enzymes, ^^i^lIII and 
BamRl. Also, the plasmid pTrc99A (Pharmacia Biotech Go., 
U.S.A.) carrying a strong inducible promoter, trc promoter, 
was digested with Ncol and BamHI, and then ligated with the 
above two PGR products. The ligation mixture was 
transformed into E. coll XL1--Blue. The transf ormants were 
selected on the LB agar medium containing ampicillin (50 fig/ 

1 ) and the recombinant plasmid pTrcSCSFmll was obtained 
therefrom (.SS^: Figure 8) . . The nucleotide sequence of the 
N-terminal portion of hG~CSF gene was determined using an 
automatic DNA sequencer (ABI Prism model 377, Perkin Elmer 
Co., U.S.A.) (s^: Figure 9). 

In order to analyze . the .secretion of hG-CSF protein 
in various strains of coli, the recombinant plasmid 

pTrcSCSFmll was transformed into E, coll BL21(DE3), E. coll 
HBIOKF- hsdiS20(rk-r mk-) _ recA13 ara-14 proA2 lacYl galK2 
rpsL20(strr) xyll-S mtl-1 supE44 A -) , E. coll MC4100(F- 
araDlSB A (argF-lac) U169 rpsLl50 (strr) relAl flhB5301 daoCl 
ptsF25 rbsR) and E. coll W3110 (derived from E. coll K-12, 

A. F-, prototrophic) . Each transf ormant was inoculated 
into 50mi aliquots of LB medium containing ampicillin ( 50 /zg 
/Z ) , respectively, to perform the experiments for 
measuring expression level and protein analysis by the 
method described in Example 2. After induction with ImM 
IPTG, however, the cells began to lyse abruptly, and within 

2 hours most cells were lysed. The cultures were 
centrifuged to obtain transformed jE. coll cells and 
proteins were analyzed using SDS-PAGE to find out no 
protein corresponding hG-CSF protein was obtained. 
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Example 5 : Construction of the recoinbinant plasmid, 

pTrcKCSFmll 

Considering the fact that ampicillin resistance 
gene (Apr) in plasmid pTrcSCSFmll is coding for P -lactamase 
which is secreted to the periplasm of E. coli, it may be 
conjectured that there was a competition between JP - 
lactamase and hG-CSF protein for secretion, and such 
competition brought about cell lysis observed in Example 4. 
Based on such hypothesis, kanamycin resistance gene (Kmr) 
was employed instead of ampicillin resistance gene (Apr) as 
a selection marker since kanamycin resistance gene encodes 
the non-secreted protein, still can be used as a selection 
pressure. In order to substitute ampicillin resistance 
gene (Apr) in plasmid pTrcSCSFmll with kanamycin resistance 
gene (Kmr), PCR was performed using primer 12: 5'- 
GCGAATTCTTTAAAGCCACGTTGTGTCCTCAAA-3' (SEQ ID NO: 13) and 
peimer 13 : 5 ' -GCGAATTCTTTAAATTAGAAAAACTCATCGAGCATC-3 ' ( SEQ 
ID NO: 14), and plasmid pACYCl77 as a target DNA. The 
resulting PCR product containing kanamycin resistance gene 
was digested with Dral, and then ligated to the plasmid 
pTrcSCSFmll which was partially digested with Dral to 
remove ampicillin resistance gene. The ligation product 
was transformed into E, coli XLl-Blue- The transf ormants 
were selected on LB agar medium containing kanamycin (25 ;^g/ 
Z ) and the recombinant plasmid pTrcKCSFmll was obtained 
therefrom (see: Figure 10) • 

To analyze the secretion of hG-CSF protein, the 
recombinant plasmid pTrcKCSFmll was transformed into E. 
coli BL21(DE3), E. coli HBlOl, E. coli MC4100 and E, coli 
W3110. Each transf ormant was inoculated into 50ml aliquots 
of LB medium containing kanamycin (25 jWg/£ ), respectively, 
to perform the experiments for measuring expression level 
and protein analysis by the method described in Example 2, 
and same result was obtained as in Example 4 wherein 
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pTrcSCSFmll was used. That is, with pTrcKCSFmll also, 
cells were lysed excessively upon induction with ImM IPTG 
and SDS-PAGE analysis of recovered protein showed no 
protein corresponding to hG-CSF protein. From the above 
studies, it has been found that the substitution of 
ampicillin resistance gene with kanamycin resistance gene 
exert no effect on avoiding cell lysis, therefore, another 
genetic manipulation which can solve the intrinsic problem 
of cell lysis caused by secretion of hG-CSF protein is 
necessary. 

Example 6 : Construction of the recombinant plasmids, 

pTHSCSFmil and pTHKCSFmll 

To facilitate E. coli to secrete hG-CSF protein 
without resulting in cell lysis, the strategy of inserting 
a small oligopeptide between endoxylanase signal sequence 
and hG-CSF protein was employed. For this purpose, PGR was 
performed using primer 14: 5'- 

CACCATCACCATATCGAAGGCCGTACTCCGTTAGGTCCA-3' (SEQ ID NO: 15) 
and primer 15: 5'- 

GATATGGTGATGGTGATGGTGCGGGCCAGCTGCAGAGGCGG-3 ' (SEQ ID NO: 16)., 
and pEDSCSFmll as a target DNA. First, PGR with primers 14 
and 2, and PCR with primers 15 and 9 were performed, 
respectively. Then, the two PCR products were mixed to be 
subjected to another PCR using primers 2 and 9, and the 
resulting product was digested with Ajfllll and BamHI to 
join into the Ncol/BamRl site of plasmid pTrc99A. After 
introducing the ligation product into JE;. coli XLl-Blue, the 
transf ormants were selected on a LB agar medium containing 

ampicillin (50 /^g/£ ), and the recombinant plasmid pTHSCSFmll 
was obtained therefrom (.aa^: Figure 11) . Figure 12 shows 
the nucleotide sequence and deduced amino acid sequence of 
the inserted DNA in the plasmid pTHSCSFmll, wherein the 
italic indicates the endoxylanase signal sequence and the 
bold indicates inserted oligopeptide sequence. 

As similarly as in Example 5, in order to substitute 



WO 01/73081 



PCT/KRO 1/00549 



1 6 



10 



ampicillin resistance gene (Apr) with kanamycin resistance 
gene(Kmr), the plasmid pTHSCSFmll was partial digested with 
Dral to remove ampicillin resistance gene (Apr) and then 
ligated with DNA fragment carrying kanamycin resistance 
gene obtained by digestion of pTrcKCSFmll with Dral. The 
ligation product was transformed into E. coll XLl-Blue. 
The transf ormants were selected on. LB agar medium 
containing kanamycin ( 50 //g/^ ) and the recombinant plasmid 
pTHKCSFmll was obtained theref rom (.as^: Figure 13) . 

Example 7 : Secretion of hG-CSF fusion protein from the 

transf oinued E. coli carrying the recombinant 
plasmid pTHKCSFmll 



15 



20 



25 



30 



To analyze the expression . and secretion of hG-CSF 
fusion protein from pTHKCSFmll, the recombinant plasmid 
pTHKCSFmll was transformed into E. coli BL21(DE3) , E. coll 
HBlOl, E. coll MC4100 and E. coll W3110, respectively- 
Each transformant was cultured in 50ml aliquots of LB 

medium under temperatures of 37 *C and 30^0, respectively, 
followed by analyses of secretion of the hG-CSF fusion 
protein as described in Example 2. After 4 hour induction 
of hG-CSF gene expression with ImM IPTG, 1ml aliquots of 
culture broth were centrifuged to obtain the cells, which 
were then analyzed by SDS-PAGE for the expression of hG-CSF 
fusion protein to show the secretion of hG-CSF fusion 
protein from all the transformed E. coll. The contents and 
the secretion efficiencies of hG-CSF fusion protein 
secreted from the each transformed E. coll are shown in 
Table 1. 



Table 1 : Comparison of production and secretion of 

hG~CSF in recombinant coll 



Host Cell 


Content of hG-CSF (%) 
in Total Protein 


Secretion of 
hG-CSF (%) 


. — , — 

Culture Time 


30°C 


37°C 


30°C 


37°C 


coll BL21 (DE3) 


22.7 


22.1 


>98 


>98 


£. coll HBlOl 


13.5 


12. 8 


81 


75 



L 
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E. coli MC4100 


22 . 1 


20.8 


>98 


>98 


E. coli W3110 


10.5 


10.0 


77 


65 


E. coli XLl-Blue 


9.4 


8.8 


56 


51 



To examine if the secreted hG-CSF protein has been 
processed correctly, i.e., if the signal sequence has been 
removed correctly, hG-CSF fusion protein was isolated from 
5 the gel and its N-terminal amino acid sequence was 
determined to be N ' -Ala-Gly-Pro-His-His-His-His-His-His- 
Ile-Glu-Gly-Arg-Thr-C , which is in agreement with the 
deduced amino acid sequence of N-terminal po-rtion of hG-CSF 
' fusion protein, indicating that the hG-CSF fusion protein 

10 was successfully secreted from E. coli. Of two temperature 
conditions, all the transf ormants showed higher secretion 
efficiency and higher hG-CSF protein content at 30 than 
37t;. Of 5 strains of transformed E. coll, BL21{DE3) and* 
MC4100 showed the highest production yield. Thus, E. coll 

15 MC4100 transformed with the recombinant plasmid pTHKCSFmll 
was named E. coll MC4 100 /pTHKCSFmll, which was deposited 
with the Kor^-an Collection for Type Cultures (KCTC) 
affiliated to Korea Research Institute of Bioscience and 
Biotechnology (KRIBB) , an international depository authority, 

20 under accession (deposition) No. KCTC 0754BP on Mar. 13, 
2000. 

The hG~CSF fusion polypeptide expressed in the 
transformed E. coll MC4 100 /pTHKCSFmll was processed during 
the secretion process to the periplasm through 

25 innermembrane to remove endoxylanase secretion sequence, 
resulting in secretion of oligopeptide/hG-CSF fusion 
protein. The hG-CSF fusion protein can be isolated easily 
by using a nickel column due to 6 histidine residues in N- 
terminal portion of the fusion protein, and C-terminus of 

30 the oliopeptide in the fusion protein can be recognized by 
Factor Xa and the oligopeptide is cleaved off. to give a 
complete hG-CSF protein. 



As clearly illustrated and demonstrated above, the 
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present invention provides an E. coll producing and 
secreting human granulocyte-colony stimulating factor (hG- 
CSF) , a plasmid vector therefor^ and a process for 
producing complete hG-CSF protein with high purity from the 
protein pool secreted by the said microorganism. In 
accordance with the invention, the hG-CSF protein can be 
prepared with high purity through rather simple process 
facilitating secretion of large amount of hG-CSF fusion 
protein into the periplasm, which does not require 
complicated processes such as solubilization and subsequent 
refolding required for isolation of the hG-CSF protein- 
produced in cytoplasm as insoluble inclusion bodies by 
conventional techniques, thus, the hG-CSF protein can -be 
widely used as an active ingredient in the development of 
supplementary agents for anticancer therapy. 

It will be understood that the above description is 
merely illustrative of the preferred embodiment and it is 
not intended to limit the scope of the invention to the 
particular forms set forth, but on the contrary, it is 
intended to cover such alteranatives, modifications and 
equivalents as may be included within the spirit and scope 
of the invention as defined by the claims. 



wo 01/73081 



PCT/KRO 1/00549 



1 9 



INDICATIONS RELATING TO DEPOSITED MICROORGANISM 
OR OTHER BIOLOGICAL MATERIAL 
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On page 7 , lines 15=25 - 



B. IDENTIFICATION OF DEPOSIT 
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Korean Collection for Type Cultures (KCTC) 
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Korea Research Institute of Bioscience and Biotechnology(KRIBB) 
#52, Oun-dong, Yusong-ku, 
Taejon 305-333, 
Republic of Korea 
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Mar. 13, 2000 



Accession Number 
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